Goto

Collaborating Authors

 dialog-based interactive image retrieval


Dialog-based Interactive Image Retrieval

Neural Information Processing Systems

Existing methods for interactive image retrieval have demonstrated the merit of integrating user feedback, improving retrieval results. However, most current systems rely on restricted forms of user feedback, such as binary relevance responses, or feedback based on a fixed set of relative attributes, which limits their impact. In this paper, we introduce a new approach to interactive image search that enables users to provide feedback via natural language, allowing for more natural and effective interaction. We formulate the task of dialog-based interactive image retrieval as a reinforcement learning problem, and reward the dialog system for improving the rank of the target image during each dialog turn. To mitigate the cumbersome and costly process of collecting human-machine conversations as the dialog system learns, we train our system with a user simulator, which is itself trained to describe the differences between target and candidate images. The efficacy of our approach is demonstrated in a footwear retrieval application. Experiments on both simulated and real-world data show that 1) our proposed learning framework achieves better accuracy than other supervised and reinforcement learning baselines and 2) user feedback based on natural language rather than pre-specified attributes leads to more effective retrieval results, and a more natural and expressive communication interface.


Reviews: Dialog-based Interactive Image Retrieval

Neural Information Processing Systems

Update: Thank you for your feedback. Given your comments as well as the discussion with the other reviewers I have slightly adjusted my score to be more positive. I still stand by my comments below in that I think the work is interesting, but that the presentation in its current form is misleading. Assuming the paper will be accepted, I implore you to reconsider the presentation of this work, particularly with respect to claiming that this is a fully fledged dialogue system. The idea that the last image chosen represents a distillation of all previous rounds of dialogue is fanciful.


Dialog-based Interactive Image Retrieval

Guo, Xiaoxiao, Wu, Hui, Cheng, Yu, Rennie, Steven, Tesauro, Gerald, Feris, Rogerio

Neural Information Processing Systems

Existing methods for interactive image retrieval have demonstrated the merit of integrating user feedback, improving retrieval results. However, most current systems rely on restricted forms of user feedback, such as binary relevance responses, or feedback based on a fixed set of relative attributes, which limits their impact. In this paper, we introduce a new approach to interactive image search that enables users to provide feedback via natural language, allowing for more natural and effective interaction. We formulate the task of dialog-based interactive image retrieval as a reinforcement learning problem, and reward the dialog system for improving the rank of the target image during each dialog turn. To mitigate the cumbersome and costly process of collecting human-machine conversations as the dialog system learns, we train our system with a user simulator, which is itself trained to describe the differences between target and candidate images.